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Abstract. We prove new lower bounds for the upper tail probabilities of suprema 
of Gaussian processes. Unlike many existing bounds, our results are not asymptotic, 
but supply strong information when one is only a little into the upper tail. We present 
an extended application to a Gaussian version of a random process studied by Halasz. 
This leads to much improved lower bound results for the sum of a random multiplica- 
tive function. We further illustrate our methods by improving lower bounds for some 
classical constants from extreme value theory, the Pickands constants iJc, as a —> 0. 

1. Introduction 

Let T be a non-empty set, {Q,J^,F) a probability space, and for each t G T let 
Zit) be a random variable defined on {fl,J^,F). Suppose that for any finite subset 
{ti,t2, ■■■,tn} ^ 7", the random variable Z(t„,)) has an ra-variate normal dis- 

tribution. We will then say, a little loosely, that {Zit)}t^j- is a Gaussian process with 
parameter set T. We refer the reader to the book of Lifshits [H] for a general introduction 
to the theory of Gaussian processes. 

In this paper we will be concerned with sup^g-y- and in particular with giving 
lower bounds for the probability that it is quite large. Results of this type have many 
applications, and the author's interest in them stems from a number-theoretic prob- 
lem that will be described later. For overviews of results in this area we refer to two 
important books, by Leadbetter, Lindgren and Rootzen [7j and by Piterbarg |14] . 

Suppose that T is a finite set, so that sup^g7-Z(t) is certainly a genuine random 
variable, and 

P(sup Z{t) > u) 

ter 

is the probability that a multivariate normal random vector takes values in a certain 
subset of R*''". We will also be interested in processes with infinite index sets, but will 
study these by looking at suitably chosen finite subsets of points t. Unless the mean 
vector and covariance matrix of {Z{t)}tsT have special forms it is typically very difficult 
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to compute the tail probability exactly. However, it turns out that as m — )■ oo, any 
correlations amongst the Z{t) that are not perfect ±1 correlations have an increasingly 
negligible effect on the tail behaviour. (This follows e.g. by the method of comparison, 
which we discuss more below) . In fact something like this remains valid when #T — > oo 
along with u, provided that the "new" Z(t) that are introduced are not too highly 
correlated. Readers familiar with e.g. Berman's theorem (as expounded in chapter 4 of 
Leadbetter, Lindgren and Rootzen [7]) should find this reasoning familiar. 

Piterbarg [H] describes several asymptotic methods for analysing Gaussian processes, 
including the method of comparison, Pickands' method of double sums, and Rice-type 
methods based on calculation of moments. He presents a quantitative version of the 
method of comparison, in which results are formulated for fixed u; but these results 
involve unspecified constants that appear to depend on {Z{t)}t^j-j so one must wait 
for u to be sufficiently large (in an unspecifed sense) before they come into pla}'^. No 
quantitative formulation of the method of double sums or the method of moments 
(for lower bounds) is attempted, and the general philosophy of these methods, that one 
need not analyse correlations of {Z{t)}t^j- except for extremely large correlations, seems 
unsuited to obtaining results for fixed u. 

One can also use metric entropy/capacity methods, such as Sudakov's minoration, to 
bound EsupjgT- Z{t). See section 14 of Lifshits [9]. Together with suitable concentration 
inequalities, such as that of Borell/Sudakov-Tsyrelson, this yields explicit lower bounds 
on P(supj£7- Z(t) > u) for fixed u. However, the lower bounds that one obtains for 
K supf^-j- Z (t) are typically off from the truth by a multiplicative factor, and then the 
lower bounds for F{snpf^j- Z{t) > u) are very far from the truth for moderately sized u. 

In this paper we develop an alternative approach to lower bounding the upper tail 
probability. The ingredients are an initial conditioning step, followed by a comparison 
(in the sense of the method of comparison) with a "model" Gaussian process that 
can be explicitly analysed. The resulting bounds are clean and can be non-trivial for 
moderately sized m, which is very important in our two applications. 

We begin with the following straightforward result. 

Proposition 1 (Conditioning Step). Let {2'(tj)}i<j<„ he jointly multivariate normal 
random variables. Write Vij := KZ{ti)Z{tj) , and suppose that: 

• (centralisation and normalisation) EZ(tj) = and EZ(tj)^ = 1 for all 1 < i < 

n; 



^We present some normal comparison results in §3, and these may give an idea of the necessary size 
of u. When the author tried to study our §6 example by these methods, he could only show that the 
supremum there is larger than about logloga;/-\/2 with high probability (by studying points t with 
spacing l/y/\ogx). Our Corollary 2 establishes that this supremum is larger than about log log a; with 
high probability. 
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• (no repeated variables) \rij\ < 1 whenever i ^ j. 
Then for any u >0 and any H > 0, 

P(max Z(ti) >u)> = > inf P(m, h), 

' m=l 

where P{m, h) is 

and the Vj = Vj.m are centralised, normalised, jointly multivariate normal random vari- 
ables with correlations 



We give the short proof of Proposition 1 in §2. The author had a more involved proof 
of (a result like) Proposition 1, based on a "reversal of roles" in the normal comparison 
procedure. Since we will need some normal comparison results later, we present these 
in §3 and give a very brief description of the reversal of roles approach as well. 

We now turn to the problem of what we will be able to say about P(m, h). If the 
correlation structure of {Z{ti)}i<i<m is arbitrary, the answer may be essentially nothing, 
in which case our attempt to give lower bounds will be at an end. However, under some 
conditions on the correlation structure we can be more optimistic, and to illustrate this 
we formulate the following result. 

Proposition 2 (Comparison Step). Let u > 0, and suppose h is sufficiently small that 
the upper bounds on the Vj in the definition of P{m, h) are non-negative. Suppose there 
exist numbers cj = Cj{m, h), dj = dj{m, h) > such that: 

(i) Cj/dj is a nan- decreasing sequence, 1 < j < m — 1; 

(ii) Cmin{i,fc}C^max{j,fc} is a stuct lowcr bound for rj^k - rj,mrk,m, for each pair 1 < 
j. A; < m — 1. 

Then for any 5 > 0, 



Pirn, h) > / -^dt ■ n $ 



where B{5) — 5-y/|^^mini<j<j„_i " ^^gj $ denotes the standard normal cu- 

mulative distribution function. 

We will prove Proposition 2 in §4, by explicitly constructing a collection of Gaussian 
random variables with the lower bound correlation structure suggested by the Cj,dj, 
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and applying a Brownian motion maximal inequality to analyse those. The reader 
might think of this procedure as pulling out some of the dependence among the Vj, 
to be analysed non-trivially using the maximal inequality. By doing this we gain the 
subtracted terms Cjdj in the product, which will be very important, at the fairly small 
cost of introducing the factor involving B{6) (and the multiplier (1 — S)). 

The reader might wonder where the numbers Cj,dj are to come from, and whether 
the lower bound obtained will not be hopelessly small in situations of interest. In fact 
we can quickly deduce the following from Propositions 1 and 2. 

Theorem 1. Let {2'(tj)}i<j<„ be as in Proposition 1, and suppose further that the 
sequence is stationary, i.e. that rj^k = — k\) for some function r. Let u > 1, and 
suppose that: 

• r{m) is a decreasing non-negative function; 

• r(l)(l + 2n~^) is at most 1. 

Then P(maxi<i<„ Z(ti) > u) is 

» { 1- / W} n * ("^^ ^ ° i^irhm) ) ) • 

where the implicit constants are absolute (in particular, not depending on {Z(ti)}i<i<n) 
and could be given explicit values. 

Theorem 1 follows by choosing H = u~^, 5 = min{M~^, ^yr(l)/u'^(l — r(l))}, Cj = 
'^j,m = ''"d"^ ~ il); dj = 1 — rj^rn = 1 ~ ^Hf^ ~ j\) the preceding propositions. In this 
case if we did not have Cjdj in the denominators in Proposition 2, then a/1 — r{j) would 
need to be replaced by a/(1 — r(j))/(l + r(j)) in the product. We do not actually use 
the theorem in this paper, as our examples require slightly different parameter choices. 
However, a reader familiar with classical limit theory for suprema of stationary processes 
(see e.g. chapter 4 of Leadbetter, Lindgren and Rootzen [7]) may find it instructive to 
compare with those results. We may not expect to obtain precisely sharp bounds from 
Theorem 1, because of the factor min{l, a/(1 — r{l))/u'^r(l)}, but it will supply good 
bounds provided u is large enough that the product term is ^ 1. For given r{j) this 
may be a much weaker requirement on u than in proofs of the classical results, which 
rely on normal comparison inequalities. 

We now move on to our two examples, which we hope will illustrate the usefulness of 
Propositions 1 and 2. In the theory of Gaussian processes, much attention has been paid 
to (mean zero, variance one) stationary processes whose covariance function satisfies 

r(t) = 1 -C|t|" + o(|t|") ast^O, 
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where C > and < a < 2. In particular, a 1969 theorem of Pickands [12] de- 
scribes the asymptotic behaviour of suprema of such processes: if /i > is fixed, and if 
suPe<f</i T'(t) < 1 for all e > 0, then 

hm e'^'/2Mi-2/"P( sup Zit) >u) = 

o<t<h v27r 

where is the so-called Pickands constant. In a second paper [13], Pickands used a 
result like this to determine the limiting distribution, as T — i- oo, of a scaled version 
of 

supo<t<T ^(^)- The scaling in that theorem thus involves Ha- See e.g. the paper of 
Shao [T7] for further discussion of the role of Ha. 

It appears that not very much is known about the size of Ha . Burnecki and Michna [1] 
describe as "mathematical folklore" the conjecture that Ha = l/T{l/a), but this is only 
known for a = 1,2. Bounds are available more generally, for example Shao [T7] used a 
representation of Ha in terms of a non-stationary process, and various techniques from 
Gaussian process theory, to show that 

_ e-i/"(l + -)) <Ha< a^/"(2.41 a/8.8 - a log(0.4 + 2.5/a) + 0.77v^)2/" 
4 a 

when < a < 1, and other bounds when 1 < a < 2. D§bicki and Kisowski [2] 

subsequently improved the upper bound on the range 1 < a < 2. D§bicki, Michna and 

Rolski [3] proved that 

" fl)V« <Ha, < a < 2, 



and in a 2009 preprint Michna [10] improved this by a multiplicative factor of 2. Note 
that, since r(l/a) ~ v^27ra(l/ea)^/'^ as a — )• 0, this is a much stronger bound than that 
of Shao [17] under that limit process. 

Applying our methods, in §5 we improve the lower bound results as a — )■ 0. 

Corollary 1. Uniformly for < a < 2, Ha ^ ^JoLieajT)^!'^ . 

This differs from the conjectured value l/r(l/a) by (essentially) a factor a2~^/", 
as a — )■ 0. One could work carefully here to obtain a completely explicit result, and 
thereby obtain non-trivial bounds for a away from zero as well. 

For our main example, we give a detailed study of the following process: 

^ cos(tlogp) 

where the summation is restricted to prime numbers p, are independent standard 
normal random variables, and ^ IS cL further (large) parameteio. 

^Roughly speaking, we are interested in the supremum of this process over 1 < t < 2, obtaining 
probability bounds that are a function of a;. As a; is increased, the supremum becomes stochasticahy 
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The motivation for studying this is its connection with a number-theoretic problem 
of Wintner [TH] • Let ep be a sequence of independent Rademacher random variables, (so 
that P(ep = 1) = P(ep = —1) = 1/2), and construct a "random multiplicative function" 
from these, as follows: 



fin) 



Y[p\n if n is squarefree 
otherwise 



We also set M{x) := ^„<2./(^)- One can view f{n) as a heuristic model for some 
deterministic functions occurring in number theory, such as the Mobius function, or 
simply as an interesting object in its own right. In our first appendix, we reproduce 
an argument of Halasz [S] showing that lower bound information about the supremum 
of a certain Rademacher process can be translated into lower bound information about 
|M(x)|. Exploiting this connection, Halasz [5] established that there exists a constant 
-B > such that, almost surely. 



M(x) ^ O(v^e-^^^°eiogxiogiogiogx^ as a; ^ oo. 

In §6, we use Propositions 1 and 2 to prove results like the following: 
Corollary 2. As x ^ oo, 

3p ^T/ilufogl ^ ^ ~ log X + O ( (log log log x) ) ) 



sup 



p 



p<x 

is 0((logloglogx)~^/^). 

The exact nature of the bound 0((logloglogx)~^/^) is not the important feature 
here: this could be improved a bit by fairly small changes to the proof, but any bound 
that is o(l) would suffice for us. The crucial feature is the level log log x — log log log x + 



0{ 
od 



log log log x)'^/'^)) that we know the supremum will typically exceed: standard meth- 
^ show that the supremum is at most log log x + log log log x (say) with probability 
1 — o(l), so Corollary 2 is very precise in this respect. This precision is crucial if one 
wishes to deduce things about |M(x)|: indeed it is the size of the second order sub- 
tracted term log log log x, together with the size of the interval over which the supremum 
is taken, that determines what can be said. 

Together with a suitable version of the multivariate central limit theorem, given in 
our second appendix. Corollary 2 allows for a substantial improvement of Halasz's [S] 
result about M(x). However, it is possible to do better still. 



larger: this is because the variance increases for fixed t, but also because the correlation at nearby 
values of t decreases. Ultimately, for reasons explained in §6.3, we allow the range of t to increase 
slowly with x as well. 

■^The process is "almost" stationary, as explained in §6.1, and a simple adaptation of Rice's formula 
yields upper bounds for its supremum. See e.g. the book of Leadbetter, Lindgren and Rootzen [7] for 
discussion of Rice's formula. 



BOUNDS ON THE SUPREMA OF GAUSSIAN PROCESSES 



7 



Corollary 3. Let A > 2.5, and let M{x) be the summatory function of a Rademacher 
random multiplicative function, as above. It almost surely holds that 

M{x) ^ 0{^{hg\ogx)'^). 

Corollary 2 implies Corollary 3 with the restriction A > 3, and this is proved in §6. 
To prove Corollary 3 for all A > 2.5, an argument by contradiction is needed to slightly 
sharpen the result of Proposition 2. This argument is rather fiddly, and for example 
does not result in a direct analogue of Corollary 2. It is given in §7. 

It seems extremely likely that, almost surely, M{x) ^ 0{^/x), and perhaps that M{x) 
almost surely has fluctuations of order a/x log logx (by analogy with Kolmogorov's Law 
of the Iterated Logarithm). Indeed, M{x) might well exhibit even larger fluctuations, 
since its probability distribution may have rather heavy tails : see, for example, the 
author's article [6]. However, an argument like our own, ultimately based on studying 
a certain average of M{x), seems unable to detect these large but rare fluctuations. 

We presented Proposition 2 in its current form, involving parameters cj, dj, 5, because 
this seems both easy to appreciate, and to lead to good results in our applications. 
However, as mentioned above, to prove the full version of Corollary 3 it is necessary 
to slightly strengthen Proposition 2. Such a strengthening may also be possible in the 
context of Corollary 1: some of the initial steps of the §7 argument transfer to that 
situation, but it is not clear whether the whole argument goes through (except that it 
does not trivially do so). 

The author also believes that there will be other Gaussian processes to which Propo- 
sitions 1 and 2 could usefully be applied, and hopes that the reader might have some 
examples at hand. 

2. Proof of Proposition 1 
In view of the decomposition 

n 

P(max Z{ti) > m) = V F{Z{t„,) > u, Z{tj) < m Vj < m - 1), 

l<i<n ' 
m=l 

it will suffice to show that, for any 1 < m < n and any H > 0, 

¥{Z{t^),...,Z{t^^i)<u,Z{tm)>u)> j= inf P{m,h). 

y In 0<h<H 

It is well known (and easy to check, by computing correlations) that Z{tm) is inde- 
pendent of the collection of random variables 

Z{tj) - rj-^Z(t„), 1 < j < m - 1. 
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These have mean zero and correlations 

and, in particular, none of them are degenerate (by assumption in Proposition 1). Thus 

P(Z(ti), Z(tm-i) < u, Z{tm) > u) is at least 

ru+H e~^^/^ 
/ P(Z(tj) — rj^rnZ{tm) < U — Tj^mX VI < j < m — 1) -j^^dx, 

Ju ' ' v27r 

from which the proposition follows. 

Q.E.D. 

In our applications, it will turn out that 

P(Z(ti), < U, Z{tm) >u + H) 

decreases very rapidly as H increases. Indeed, we will always choose H so that its effect 
in P{m,H) is negligibly small, and therefore only really need to understand P(m, 0). 
This is the point of introducing the initial decomposition of P(maxi<j<„ > u), 

rather than trying to understand F{Z{ti) < m VI < i < n) directly by conditioning. 



3. Normal comparison results 

3.1. Classical comparison results. In this subsection we present the equality under- 
lying normal comparison results, and state some fairly classical consequences of this. We 
will use these in a few places, and hopefully they will also supply an unfamiliar reader 
with some idea of how the method of comparison (as it is referred to by Piterbarg [13]) 
is traditionally employed. Our treatment largely follows Li and Shao [8], although we 
would also like to draw attention to a 1954 paper of Plackett [15], which contains a 
similar presentation of the basic comparison result]^. 

If a, 6 G M", write a < 6 to mean that every component of d is at most the corre- 
sponding component of b. We have the following identity, which is the key part of the 
proofs of various normal comparison results: 

Exact Formula 1 (Following Li and Shao, and others). Let X = (Xi,...,X„) and 
W = [Wi, ...,Wn) be centralised and normalised n-variate normal vectors, with non- 
singular covariance matrices Var(X) = (Cov(Xj, Xj))i<ij<„ = {rlj) and Var(W^) = 
(rf'j) respectively. Letu^W^. Then 

P(X <u)- ¥{W <u) = 

^Plackett was interested in the numerical approximation of multivariate normal probabilities, but some 
later comparison results are readily obtained from his paper. Unfortunately this work does not seem 
to be very widely known. 
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<i<j<n 

where ZC^') = {zf'\ zlt^) is multivariate normal with covariance matrix 



1< j<j<n 



(rg)) := War(X) + (1 - /i)Var(iy), 
and (j){x,y;r) denotes the standard bivariate normal density with correlation r, viz. 



=e 



To prove the formula one writes 

/■I d 

F(X <u)- F(W <u)= — P(Z(^) < u)dh, 

Jo dh 

observing that 

l<i<j<n '-" i,j l<i<j<n °° ^-^ 

Here fh is the density function of Z'-'^^ the range of integration has its obvious meaning, 
and the second equahty uses the fact that 



h 



which follows by expressing the multivariate normal density in terms of its characteristic 
function. 

Exact Formula 1 provides rigorous support for the intuitive idea that distributions 
with "nearby" covariance matrices may have like behaviour. The inequalities that we 
derive next may express this in a more striking way: they are a composite of results of 
Li and Shao [8] and of Leadbetter, Lindgren and Rootzen [7], although in most respects 
are unchanged from bounds of Slepian (1962), Berman (1964, 1971), and Cramer (1967). 

Comparison Inequality 1 (Following Leadbetter, Lindgren and Rootzen, and Li and 
Shao). If X, W, u are as in Exact Formula 1, and 1 denotes the indicator function, then 
each of the following is an upper hound for P(X < n) — P(iy < -it). 

(i) 



l<i<j<n ^ ^ 

(ii) 

^ y 1 („)(arcsin(r(V)-arcsin(r(°)))e-("?+^')/2(^+-^^^l^'.^'l'l'^Sl» 

l<i<j<n 
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[lllj 

.(1)| |^(0)|1^3/2 



2 (1 + maxjlr; , I, |r) y II) ' ,2, 2\,r,n, r, (i), , (o) 



^ («f+«|)/2(l+max{|r|yi,|r,^;;^|}) 

''''J / 2 1 2\ /1 ri (1)1 I (0)n 

i<i<i<n [ul + Mj) ^ 1 - maxj |r^^ | , \r\> \ } 

To obtain the first bound, we overestimate the conditional probabihty in Exact For- 



mula 1 trivially by 1 and insert the definition of (f){ui,Uj;r^^h, observing that 



1 e-K-2<';'"^«i+^')/2(l-(^*J )') 1 



./T_r,g))2 y/l-(rg))2 



For the second bound, we overestimate the exponential by e"^"*'^"^'*/^^^'*'™^^''-'^''^ '''^^.j '^^ 
and then evaluate the integral over t. Alternatively, by making a substitution x = 
1^(1 — + 1) we find that for any < a < < 1, and any > 0, 



1 cy/{l-a)/{l+a) . 



a y/1 — t"^ J y/{i-b)/{i+b) 1 + a;- 



n _L M3/2 i'y/{l-a)/(l+a) 



Vl - bK JyJ{i-b)/{i+b) 

Since this integral is at most e~^*-^^^^/^*-^^''\ the third bound follows directly. 

As Leadbetter, Lindgren and Rootzen [7] point out, the assumption that X and W 
are non-singular is not necessary for the above bounds, as one may pass to that case by 
making arbitrarily small changes to the entries of the covariance matrices, and the first 
bound (from which we derived the others) is a continuous function of those entries. 

Typically, one would apply Comparison Inequality 1 by observing that the covariance 
matrix of X "looks rather like" the covariance matrix of a well understood multivariate 
normal distribution, e.g. that it looks like the identity matrix. See the paper of Li and 
Shao [S] for some examples. If the entries of the covariance matrices are sufficiently close 
together, or if one can afford to choose the entries of u very large, then Comparison 
Inequality 1 can supply strong information. 

We finish with a well known qualitative consequence of Comparison Inequality 1. 

Comparison Inequality 2. Let X = (Xi, ...,Xn) and W = {Wi, Wn) be centralised 
and normalised n-variate normal vectors, with covariance matrices Var(X) = (r^^j) and 
Var(W^) = (rf'j) respectively. Let n G M". If rf'^- < rf^ for each 1 < i,j < n, then 

¥{X <u)< F{W < u). 
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The special case of this result where u = {u,u, ...,u), for some m G M, is usually 
referred to as Slepian's lemma. 

3.2. Reversal of roles. As promised in the introduction, we now give a very brief 
description of the reversal of roles argument that originally served in place of Propo- 
sition 1. For the applications in this paper, Proposition 1 entirely supersedes such an 
argument, but it is possible that it may be useful in other contexts. 
We aim to give an estimate for 

P(Z(t„) > u, Z{tj) < M Vj < m - 1), 

under the conditions of Proposition 1. Our idea is to apply the methodology of Exact 
Formula 1, but viewing the sum of integrals that arises as a main term for subsequent 
analysis, and the subtracted probability as an error term. Thus we do not choose W to 
have a standard distribution, but so that this subtracted probability is zero. 

More concretely, we let Ai,...,Am be a collection of A^(0, 1) random variables, all 
independent of one another and of the Z{ti). Let e > 0, and define 

X; = Wi := / l<i< m-1; X^, := -; W„ ' 



VTT^ VTT^ VTT^ 

Precisely analogously to Exact Formula 1, and adopting the same notation r^'^j as there, 
we find that 

F{Xi,...,X^^i<u,Xm>u)-F{Wi,...,W^.i<u,Wm>u) 

l<j<»n-l 

— / Hu,u; — )P{t,h,e)dh, 

l<i<m-l "T ^0 "T 

say. We need the e perturbations here to ensure that we work with non-singular mul- 
tivariate normal distributions. However, at the end of the argument we can let e — )■ 0, 
whereby we will have compared F{Z{tm) > u, Z{tj) < m Vj < m — 1) with 

F{Z{ti), < U, Z{tm-l) >u)=0. 

It is less straightforward to analyse P{i, h, e) for 1 < i < m — 2 than to analyse P{m — 
1, /i, e), and to give lower bounds one can replace those probabilities by Ir^^^nrn-i- 
our examples these other terms give a lower order contribution, but this need not always 
be so. However, to analyse P(m — 1, /i, e) one can note (as did Li and Shao [8]) that for 
any 1 < i < m — 1, the collection of random variables 
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is independent of {Z^'^\ Z^^}. In our examples this leads, after some slightly fiddly 
manipulations, to a probability estimate much like Proposition 1. (For, in our examples, 
Z(tjn-i) and Z{tm) are always very highly correlated, and so P{m— 1, h, e) is essentially 
the same as the simple conditional probability in the proof of Proposition 1.) 



4. Proof of Proposition 2 

In view of Comparison Inequality 2, and assumption (ii) in the statement of Propo- 
sition 2, we may proceed on the assumption that for 1 < j. A; < m — 1 and j ^ k, ^VjVk 
is equal to 

Cimn{j,k}dr[iax{j,k} 

The key to the proof is the explicit construction of such random variables from a col- 
lection of independent normal random variables. 

Let Yi, ...,Yn,Zi, ...,Zn be independent standard normal random variables, and for 
1 < i < n let ai, Pi he real numbers satisfying 

j<i 

Then the random variables 



are again jointly multivariate normal, have zero means and unit variances, and satisfy 

fe<min{ij} 

We also note that ii ui, ...,Un are any real numbers, if > VI < i < n, and if 5 e R, 
then 

F{Xi < VI < i < n) = F\Zi< \ ^^^^^ VI < i < n 
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We now set n = m — 1, and define real numbers a^, /3j by 



A := ^^^^=, Z^"i-=3"' 1<^<"^-1- 



1 - rf^ j<i 



The conditions on q, cij in Proposition 2 ensure that we can define a^, /3j in this way, and 
that they satisfy the various hypotheses above. The reader may also check that the Xi 
have the correlation structure that we wanted, and that the product term in the previous 
paragraph is as in Proposition 2 (when ui is taken as {u — ri,m{u + h))/ y^l — rf.^. It 
remains to eive a suitable lower bound for P(Ej<i < ^ VI < i < m - 1). 

It should not come as a surprise that the behaviour of partial sums of independent 
normal random variables is rather well understood. For example, writing {iyf}t>o for 
the standard Brownian motion, (see e.g. chapter 5 of Lifshits |9] for much discussion 
of this process), one has the following neat result, which we quote from chapter 13.4 of 
Grimmett and Stirzaker if t > 0, then 

maxiy, = \Wt\ = |A^(0,t)|. 

0<s<t 

This is useful to us because (^j<j aj^)i<j<m-i = {Wj2.^.a'^)i<i<m-i, so that 

6u., 



P(^ ajYj <^Vl<i<m-l)> $(fi) - $(-5) 

3<i 



where 



■'j<m-l j 

as claimed in Proposition 2 



B = — = mm 7r = "l/ i^i^i ; • 

2 l<i<m-l fii y Cm-1 l<i<m-l di 



Q.E.D. 



The proof just given divided naturally into two parts: first we constructed the Xj 
to explicitly model the Vj, allowing us to extract some of their dependence in the 
manageable form of the Yf, and then we analysed the Yj using a result about Brownian 
motion. Both of these steps could conceivably be improved, potentially leading to a 
better lower bound for P(m, h). 

In the analysis of the Y^, we used a fact about the probability that a Brownian 
motion remains below a constant level for a period of "time" t. We could have used 
results about the probability that it remains below, for example, a sloping line, allowing 
some fiexibility in the upper bounds that we ask for. However, in our applications 
these probabilities are never particularly small, and the author doubts that a more 
complicated approach would be advantageous in many situations. 
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It appears to the author that the modelhng part of the argument is weaker. Thus, 
in our examples, our lower bound Cmin{j,fc}'^max{j,fc} for ^ — m'^fc.m is not very tight 
when j and k are close together. An alternative way to think about this is to note that 
we can replace the independent Zj in our construction by any standard normal Aj with 



EAjAk < 



The correlation bound here looks complicated, but this may be somewhat illusory; for 
example, if we were able to make the choices Cj = rj^^i dj = 1 — rj^^, as for certain 
stationary processes, we would want 

a/(1 - rj-m)(l - rk,m) 

These quantities are not likely to be easier to work with than the correlations rj^k of our 
original random variables. However, to prove Proposition 2 we need upper bounds for 
upper tail probabilities (which then lower bound the probability that none of the Aj are 
too big), and these may be easier to come by than lower bounds, for example by using 
Rice's formula as part of a first moment argument. Another approach to improving 
Proposition 2 along these lines is worked out in §7. 



5. Application to estimating Pickands' constants 

Suppose that ti < t2 < ... < tM is a set of equally spaced real numbers. Suppose, 
moreover, that {Z(ti)}i<i<M is a mean zero, variance one, stationary Gaussian process, 
with decreasing covariance function r(t), t > 0. If a > 0, then (the proof of) Proposition 
1 implies that 

P( max^ Z{ti) >u) > MF{Z{tM) > u, Z{tj) <u\/j < M) 

^ Me .^g-a-aV2n^ j^f P(M,h). 

J2-KU Q<h<a/u 



In a paper from 1996, Shao considers a (mean zero, variance one) stationary 
Gaussian process indexed by the half-line [0, oo), with covariance function 

r{t) = ^(e"*/2 + e-*/2 _ _ ^-t/2y^^ ^ > q_ 



BOUNDS ON THE SUPREMA OF GAUSSIAN PROCESSES 



15 



Such a process exists for each fixed < a < 2. As t — ?■ 0, we see (as did Shao [17j) that 
r{t) = 1 - r/2 + 0{t^). We also note that, for t > 0, 

r'(t) = -(e"*/2_e-"*/2_(e*/2 + g-V2)(gV2_g-V2)a-i) 

< 0. 

In proving Corollary 1, we shall assume that a is smaller than a certain fixed positive 
constant, which could be found explicitly if desired and should certainly be less than 
1. There is no loss in doing this, because Ha ^ 1 for a larger than such a constant. 
To prove the corollary, we will study Shao's stationary process at the sample points 
ti = i/M. Making the simple choice a = 1 in the above discussion, we have 

Ha > 2^/" lim (Mu~2/a p^^^^ 

u^oo 0<h<l/u 

and we will investigate the largest value of M, depending on u and a, for which we 
can show that info</i<i/u P(M, /i) ^ 1. Note that a large value of M corresponds to a 
close packing of sample points in the interval [0, 1]. The reader should also note that 
there is nothing intrinsically asymptotic about most of our calculations, although we 
are interested in letting -u — > oo to compare with Pickands' theorem. 

We want to apply Proposition 2, and can do so with the natural choices 

cj = r{{M-j)/M), d, = l-r{{M-j)/M), 1 < j < M - 1, 

since r{t) is decreasing and positive. Thus P{M, h) is at least 

= - $(-5)) n ( (1 + - 6)^u'{r/2M- + 0{f/M^))] , 

where B = B{5) is as in Proposition 2, and 5 will be chosen later in terms of a. 
Together with the known and conjectured bounds for Pickands' constants, this suggests 
taking M = [{bu'^a/2y/°'], where now we investigate how large b may be chosen. For 
definiteness in our calculations, we declare that we shall certainly have 1 < 6 < 10 (and 
of course our conclusion will be that taking 6 as e/2 is permissible). 

First we note that the part of the product over j > M^^^ is 1 + o(l) as m — )■ oo. For, 
since r{t) is decreasing and a, 6 are small, each of those terms is at least 



((1 + 0{a))u{l - (5)-^l-r(M-3/4)) > ^((i/2)^u^{M-^"/y2 + 0(M-3/2))). 
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If u is large enough, this is 

> 1 _ g-(l/8)«2(M-3^/V2+0(M-3/2)) ^ ^ _ g-VS 



and clearly (1 — e v^)*^ is 1 + o(l) as m — )■ oo with a fixed. 

When j < M^/^, provided that m is large enough in terms of a < 1 we see 

u^f/M^ < u^M~^/^ = 0{u-^a-^/^), and M'^/uT = 0{alf). 

Since for x > 2, say, we have > 1 — x~"'^e^^^/^ > e~^^ the part of the 

product over j < M^^^ is at least e~f^^'^'"''^\ where 

Note that, since we assume that a and 5 are small, the arguments of $ in the product 
are all at least 2. Now 

/.Ml/4 



j<Mi/4 



e ^aj/ 



s ((r^)"°"-'°-'r(i/.). 



In view of Stirling's formula, the right hand side is asymptotic to \J2'K / a{2h/ e{l—5YY^°' 
as a — )■ 0, so is at most 4a~^/^(26/e(l — 5)^)^/", say, when a is small. 
Finally, observe that 

B{5) = 6^{l-r{l/M))/r{l/M)u{l + 0(1/^2(1 - r(l/M)))) > 6/^/^, 

provided that u is large enough in terms of a. If we make the choice 6 = a, then b 
can be chosen as large as e/2 whilst still ensuring that f{b,6,a,u) = 0(1). Corollary 1 
follows from making these choices. 



6. Application to a number-theoretic process 

6.1. Preliminary calculations. Before we can apply Propositions 1 and 2 to our sec- 
ond example, we must reduce to studying a finite set of sample points t, and determine 
the covariance structure of the corresponding random variables. As might be expected, 
variants of some of these calculations appear in Halasz's paper although we need to 
be more precise in several places. 

It is useful initially to ignore the contribution from "very small" primes to our random 
sums. Let y be a parameter, later to be chosen as a suitable function of x. It is immediate 
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that if s, t e M, then 
, cos(tlogp) cos(slogp), cos (t log cos (s log 

H 5'p 1/2+1/ log X ■ 2^ pl/2+l/logx ^ pl+2/logx' 

y<p<x yi^p^x yi^pi^x 

1 cos((i + s) logp) + cos((t — s) logp) 

~ 2 2-^ pl+2/logx ' 

?/<p<3; 



For t G M, we let denote the normalised random variable 

S.<P<x 9p COs(t logp)/pl/2+l/log. ^ ^^^^^^ logp)/y/2+l/logx. 



^.<P<xCos2(tlogp)/pi+2/iog. ^E,<p<.l/2pi+'/i°^^ + E,<p<.cos(2tlogp)/2pi+2/iog. 

By a strong form of the prime number theorem (see e.g. chapter 6 of Montgomery 
and Vaughan |llj) we have 

7r(z) := #{p < ;z : p is prime} = /" -^ + 0(^6-'^^^), z > 2, 

J 2 ^OgU 



where > is a certain constant. Then if z < x 



-—^= / -——d7i{u) = / — rfM + c(x) + 0(e-'^^] 

= log log z + 0(1), 
where c(x) depends on x only. Moreover, if a 7^ 0, 

y ^^^(";"g^) = r cos(alog.)n-V'°^- ^^ ^ |a|)e-^^) 

^ pl+2/logx / ulogM I 17 7 

cosfaw) , /-^"s- cosfaw^ 



logy •''logy 

+0((1 + |a|)e-^^^) 

^^du + 0{—^) + 0((1 + laDe-'^v^), 
aiogy u a log a; 

where the third equality follows using integration by parts, since ^((e"^"/^"^^ — = 
0(1/ log^ x) for logy <u< \ogx. We deduce that if s, t > 1, and s ^t, then 

ftlZ fs) = " M*+^)iogy^ v|t-s|iogx7 vv ' ; ; 

/; ..../.t.iog. + O(i^) + 0((t + s)e-^v^) 

We now set out the specific situation to which our Gaussian process results will be 
applied. Let E' > 1 be a further parameter, (to be chosen later as a function of x), and 
for n G N U {0} and M < logx/E introduce the sets 

Tn = Tn,.,EM := {2n + 1 + iE/ logx : 1 < I < M} C [2n + l,2n + 2]. 
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We seek lower bound information about maxo<„<B sup^^^^ fo^^ certain B. 

At this point the reader may be rather appalled by the number of parameters around, 
so we hasten to point out that most of these will "select themselves" in due course, and 
can essentially be ignored. The sets Tn are sufficiently separated that the behaviour of 
Zy{t) on different sets is roughly independent (see §6.3). Moreover, up to error terms 
the correlation KZy{t)Zy{s) depends on s,t through \t — s\ only (i.e. our process is 
approximately stationary). Thus we focus on understanding sup^g^^ Zy(t), and defer 
thinking about larger values of n until we put our results together in §6.3. 

The parameter E controls the spacing of sample points within their blocks Tn, and 
in §6.2 it will be chosen as small as possible such that we obtain good probability 
lower bounds from Proposition 2. We declare for now that we shall certainly have 
E < e^^°siogx^ gg^y -yy-g ^ould like to take M as large as possible, but to simplify our 
calculations we choose M — [log x/XE' logy], where K is an absolute constant that 
forces 'EZy{t)Zy(s) > 1/ log log x, say, for t,s e % (see below). The parameter y is 
present to get rid of "beginning of series" effects, in particular ensuring that we have 
enough independence of Zy{t) over different blocks Tn- It will be selected in §6.3, but 
we declare for now that we shall certainly have logx <y< e(^°sioga:)i°° 

In the above set-up, if s, t e To are distinct then 

floe X 



Jlt—sliosy u ^ 



^Zy{t)Zy{s) = '1*"^'^°^%" . Mt-^|10g.^ 

" " ly^TTv^kj^u Vlogy log logx 

_ J\t-s\logy u +0 



V log log 

log(l/|t - s\ logy) ^ ^{ 1 



log log X — log log y \ log log X 
6.2. Implementation of Propositions 1 and 2. We order the points of To in the 
obvious and natural way, writing = 1 + iE/ logx, 1 < i < M. We aim to show 
that the maximum of our original random sum is about log logx, and the standard 
deviations that we normalised by are about (log log x — log log y) / 2, so we take u — 
\/ 2 (log log X — log log y) . Then 



u{l - rm-i,m) = ©(log-E/Vioglog^^^Ioglogy) = Q{\ogE/u), 
so we can safely make the canonical choice H — l/u'm Proposition 1. 
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We now seek to apply Proposition 2 to give a lower bound for P(m, h), where 1 < 
m < M and h < H. Let j < k < m — 1. If \j — k\ < log^^^ x then 



log log X — log log y \ log log X J ' \ log log x 



log log X 



In fact this is also true when \j — k\> log^^^x: since J^^^y{cosu/u)du is a decreasing 
function of < a < 1/ logy, we have 

'^^''^ r 1^,/,^" . — V log y log log a; 

Jy 1/1+2/ logic log u \ tof to to 



1 



.logy log log ' 

and Tj^m > 1/ log log a; > 1/logy because m < M. This means that it is legitimate to 
choose 

iog(m-j)i; + o(i)_ iog(m-j)E + o(i)_ 1 

log log X- log logy log log X- log logy 

in Proposition 2. Setting 5 = 1/ log log x, to match the size of the other "big Oh" term, 
we discover that 

s(i/iogiogx) = e ( y,^^^] = e ^ ^ 

It follows from all of this that 

P{m,h) » /!p^n$f(l + 0( ^ ))V21og(m-i)£; 
log log X ■^j^ \ log(m — jjii/ 

log log a; ' 
provided always that E is larger than an absolute constant. Making the choice E 



(log log x)^/^/ \loglogx 



VTogToglr, the exponential becomes 0(1), and we find P(m, h) ^ v^log log log x j log log x. 
Plugging this lower bound into Proposition 1, it follows immediately that 

^/ ^ / N /—r. — \ \ — \ M-v/log log log xe""^/^ 

P supZ, i > V2 log log X- log logy » \\ 

teTo M log log X 

^ yiog log logx 
(log log x) 2 

6.3. Exploitation of the lower bound. The lower bound obtained at the end of §6.2 
is useful raw information about {Zj^(t)}(g7^,. However, in order to deduce results about 
the summatory function M{x) of a random multiplicative function, (as described in the 
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introduction), we need to be able to say that the supremum is large with probability 
close to 1. 

To do this, our idea is to "sample the supremum several times independently": since 
the probability that the supremum is large is not too small, if we sample a few times it is 
very likely that we will obtain a large value. Although we do not have lots of independent 
copies of {Zy{t)}, we can achieve something like this by considering {Zy{t)}t£rn fo^ 
different n. If Be''^^^ < say, then for distinct l<s,t<2B + 2we have 

flogx cosndu + 0( i ) / 1 



r i^o/i*^" n V log V log log X / ' 

as at the end of §6.1. For such s,t with |s — i| > 1, the calculations in §6.1 supply a 
more precise result, namely that 

' 

|r — s| log y log log x log log x 

Thus, by the second part of Comparison Inequality 1, 

|P( max supZ^(i) < V2(loglogx- log logy))- JT F{sup Zy{t) < V2(log log x - log logy)) | 

0<n<B f(zT te^T 
- - tfc/n 0<n<B ^^'■^ 

< Y Y \^Zy{2i + 1 + kE/ log x)Zy{2j + l + lE/ log x)\ 

loo" T ^ ^ 

^ 0<i<j<Bl<k,l<M 

V.2 „, n/r2 



log^xloglogXp^^^^ Vb-j|logy J 

(log log x)^ V logy / 
We have seen that, at the level of precision required in §6.2, the correlation struc- 
ture of {Zy(t)}t£Tn is the same for each < n < B. Thus our calculations concern- 
ing supjgT-g Zy(t) go through for sup^^j-^ Zy{t) as well, and P(maxo<„<B sup^g^-^ Zy{t) < 
^2(logloga;- log logy)) is 

^ 1 f BlogB ^ ^3g-dVl5i^^ _^ g-e((B+l) Vlog log log x/{log log 

(log log x) 2 V logy j 
The right hand side is O(e-0(Viogiogiog^)) if we take B = (log log x)^, and y > logx. 



For our application to M{x), we need a version of the above result in which Zy{t) is 
replaced by 

Ey<p<. fiP) COs(t logp)/pV2+l/log. 



^E,<p<.COs2(nogp)/pl+2/log. 
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with f{p) independent Rademacher random variables. This can be achieved using a 
muhivariate central limit theorem, as explained in our second appendix, at the cost of 
replacing a/2 (log log ,t — log logy) by a/2 (log log ,t — log logy) — 1. In the application 
of the central limit theorem, we need y to be at least a certain power of logx, say 
y = log^ X. This choice is also permissible for all of the preceding calculations. 
Finally note that for fixed i e R, 

p<y p<y 
as in §6.1. Applying Chebychev's inequality to this estimate, it follows that 

> aoglogloga;)^/^) = 0((logloglogx)-V^), 

p<y ^ 

similarly if the Qp are replaced by Rademacher random variables /(p). The behaviour 
of these sums is independent of the behaviour of the sums over y < p < x, so that 
Corollary 2 quickly follows from our bounds. 

As discussed in our first appendix, the tail sum J2p>xf(p) cos{tlogp)/p^^'^'^^^^°^^ is 
almost surely convergent, and in fact it converges in square mean, so that 

E( V MS!^<n2^)2 < I ^ o( r - ) = 0(1) 

p>x ^ p>x ^ "^^ ° 



We temporarily set d{x) := infi<i<2(iogiogx)2 WE(Ey<p<x /(^^) cos(Uogp)/pV2+i/iogx)2^ 



so that d{x) = \/ (log log a; — loglogy)/2 + 0(1) by the calculations in §6.1. Applying 
Chebychev's inequality again, together with the Rademacher version of our estimate for 
Zy{t), we find that 

^/ 1 fip)cOs(t \ogp) /— ; ; ; (log log log x)^/^ , 

^ ^ V2(.ogiog.-iogiog.)-i^ ' ' ) 

is 0((loglogloga;)^^/^). Then applying the First Borel-Cantelli Lemma, one quickly 
deduces that: for any fixed A > 3, there almost surely exists a sequence {xk), tending 
to infinity, with 

^ f{p)cOs{t\ogp) ^111 ^11 /II 1 1 

sup 2^ 1/2+1/ log Xfe 2 - Xk-A log log log Xk . 

l<*<2(logloga;fe)2 ^ P 

By the argument in our first appendix, (and specifically by Supplementary Lemma 1 
from that appendix), this implies Corollary 3 for A > 3. 
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7. Refinement of Proposition 2 for the random multiplicative 

functions application 

As discussed at the end of §4, Proposition 2 may be refined in that the product term 
can be replaced by any lower bound for 

for any standard normal random variables Aj satisfying 



It will be convenient to write U{j, k) for this upper bound on the permissible correlations. 
By assumption about the numbers Cj, dj, we always have U{j, k) > 0. 
In our application to random multiplicative functions, U {j, k) is at least 



^(1 - dm - Cjdj)^^ - - Ckdk) ^J{1- - Cjdj) (1 - ^ - Cfe4) 

-log\j -k\E + Oil) ^ log\j - n,\E + Oil) 

y/{\og \j - m\E + 0(l))(log |A; - m\E + 0(1)) ^^(log \j - m\E + 0(l))(log \k - m\E + 0(1)) ' 
for 1 < j < A; < m — 1. It seems sensible to consider intervals U /E < |m — j|,|m — A;| < 
U~^^/E, (with L < 2 a parameter to be chosen), on which we see 



ilogL \ilogL^ 
In the random multiplicative functions example, the upper bound {1 — S){u — Tj^miu + 
h))/ ^1 — r|„^ — Cjdj for the Aj on such an interval is at least (l+0(l/i logL))i/2i logL. 
So considering intervals independently, we can replace the product in Proposition 2 by 

[log(£;m)/logL] 

TT nAj < (1 - -^)^/2i]^ \fUlE <\m- 3\ < U+'/E), 

I log L/ 

i=0, ° 
Li>E/2 

where c is an absolute constant, and Aj are any standard normal random variables 
whose correlations are bounded as described. 

The crucial point is that on each interval, and up to the "big Oh" term, the bound on 
U {j, k) corresponds to a stationary correlation structure that we can hope to understand. 
Indeed, it is essentially a re-scaled version of the original correlation structure of our 
random multiplicative functions process. 



Using these ideas, we shall establish the following result: 
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Proposition 3. If E is a sufficiently large constant, then the following is true. Let 
{Zy(t)}t^-j-o = {^y{'t)}t£TQ ^6 the Gaussian process described in §6.1, for such a choice 
of E. Let e{x) be any function tending to zero as x ^ oo. Then for some sequence of 
X, tending to infinity, we have 



P(supZ.W > V2(loglog.-loglog,)) > ^g^l^. 



Here we include a superscript x to explicitly record that Zy[t) — Zy{t) depends on 
X, and we remind the reader that we had y — log^ x. 
Recall from §6.1 that 



\og{\3 - k\E) + 0{l] 



, l<J,/c< 



log a; 



log log X — log log y KE log y 

When is larger than an absolute constant, we can choose x{i) e R such that 

1 - 



V2(logloga;(i) - \og\ogy{i)) 



i log L 



)V2ilogL. 



Here we write y{i) = y{x{i)) — x{i). Then if C e N is larger than an absolute 
constant, and we take L — 1 + we have 

log(|i-fc|E)+logC + 0(l) 



1 + ^) 



<U{j,k), 



y^^^ \ogx{i)' y^^^^ ' loga;(i)' ^ (1 - c/(i log L))2i log L 

where U{j,k) denotes the bound for interval i. This only makes sense if jC,kC < 
\ogx{i)/KE\ogy{i), but that will hold for example if j,k < L^/KEC'^. Then if i is 
sufficiently large that 



we can say P(^j < (1 







'U' 


< 


D 


E 






KEC\ 



ilogL 



)^/2i hgl \/U/E <\m- ] \ < / E) is at least 



P(supZ;g(i) < V2(loglogx(i)-loglogy(i))). 



Notice that, for our fixed choice of L. the various requirements for i to be "sufficiently 
large" will all be satisfied ii i> + D, where is least for which L* > E/2, and D is 
an absolute constant. Thus the product term in Proposition 2 may be replaced by 



n* 



f log(Bm) 1 
logL 



:i + C'(^-^))V21ogj^) n P(sup^;g(i) < V2(loglogx(i) -loglogy(i))). 

^■^ ^ i=iE+D 

We also note that, obviously, x{i) tends to infinity with i. 

Now suppose that the proposition failed, so for all sufficiently large x the tail prob- 
ability was smaller than required. Then for all i from some point onwards we would 
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have 



n^^pZlm < V2(loglog:.(i) - loglog !,(,))) > l- (,„gi„g^(,))3/2 



> 1-0' ' 



(i log L) 3/2 

so the product term in Proposition 2 could be replaced by a positive constant. But then 
the argument of §6.2 would supply that 



nsnp Zy{t) > V2(loglogx-loglog|/)) > 

ieTo E(loglogx)^/^ 

which is a contradiction for x sufficiently large. 

Q.E.D. 



Armed with Proposition 3, we can repeat the argument of §6.3 with E chosen to be a 
large constant (rather than i/log logx), and B then chosen as (log log a:;)^/^ log log log x, 
say (rather than (log log x)^). The reader should note that there is a subtlety involved, 
as this requires lower bounds for 

P(sup Zy{t) > v/2(loglogx-log logy)), 0<n<B, 

t£T„ 

whilst Proposition 3 concerns sup^g-^j Zy(t) only. However, modifying the choice of E 
and K by some multiplicative constants in the definition of 7^, n ^ 0, so that E is 
larger but EK remains the same, we can arrange using Comparison Inequality 2 that 

P(sup Zy{t) > v/2(logloga;-loglog2/)) > ¥ {snp Zy{t) > v/2(logloga; - loglog?/)). 

We also only have probability bounds for a sequence of x tending to infinity, rather than 
all X, but we do not require that in §6.3. Corollary 3 follows from these considerations. 



Appendix A. Random multiplicative functions and Rademacher 

PROCESSES 

In this appendix we sketch the connection between the summatory function M{x) 
of a random multiplicative function (as defined in the introduction), and a certain 
Rademacher random process. The argument we give is essentially that of Halasz p]. 

In view of Wintner's [18] result that for each e > 0, M{x) = 0{x^^'^~^'^) almost surely, 
we know that the Dirichlet series 



n=l 
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is almost surely convergent in the half plane > 1/2, and then satisfies 

Miz) , 
F{s) = sj^ ^dz. 

On the other hand, writing C,{s) := l/n*,3?(s) > 1 for the Riemann zeta function, 
we have the Euler product identity 



F{s) = n(i 



pS 



oEp /(p)/p=-Ep i/V^+Efc>3 Ep(-i)'=+V(p)Vfcp'=" 



gEp /(p)/p''-iogC(2s)/2+Efc>2 Ep i/2fcp2fc«+j:^^3 Ep(-i)'=+V(p)Vfcp*''_ 



This is certainly valid when 3ft(s) > 1, and almost surely extends to 3?(s) > 1/2 in view 
of Kolmogorov's T.„ee Series thecenfl and the identity theorem of complex analysis. 
Thus in the domain l/2<cr<l,l<t<2, say, we almost surely have 



z>i ^z{a- 1/2) z>zo V^icT- 1/2) ' 



where the second inequality follows by splitting the integral at zq := e^^v^^^. Taking 
a = 1/2 + 1/ logx, where x > 2 is a parameter, we find that 

eE,/(P)cos(nogp)/pV.+v.o..^^^^^^p^l^^j^^^ iMpi^ l<t<2. 

z>l z>e^/^°S^ 



For the proof of Corollary 3, we need a version of the preceding inequality that is valid 
for a larger range of t. Using the estimate \ \ogC{a+it)\ < loglog |t|+0(l), cr > 1, \t\ > 2, 
which is contained in e.g. Theorem 6.7 of Montgomery and Vaughan [11], we can say 
that for t > 1, 



z>l V ^ 2>e^'°s^ 

This immediately implies the following result: 

Supplementary Lemma 1. Let g{z) be a decreasing function. If, with positive prob- 
ability, we have M{z) = 0{y/zg{z)) as z ^ oo, then with positive probability we have 

t>i 

for all X > 2. 



Since Halasz's paper [5] seems to be difficult to get hold of, it is perhaps worthwhile 
to briefly discuss Halasz's [H] own use of the foregoing argument. He shows that there 

^This implies that J2p fip)/p'^ converges almost surely when 3fi(s) > 1/2. We then use the standard 
fact, proved using partial summation, that a Dirichlet series is a holomorphic function strictly to the 
right of its abscissa of convergence. 
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almost surely exist sequences of real numbers x^, tending to infinity, and of sets Sk ^ 
[1,2], of measure > l/logx^, and of sets C [1,2], of measure < l/logx^, such that 

^ /(p) cos(tlogp) ^ iQgiQg^,^ _ ^ 29 log log Xk log log log Xk Vt G S^, 

p<xk 

E^, ,COs(tlogp) ^/ X COsftlogp) ^, z , r, „nv ,^ 

/(p) - E /(p) ^i/2;i/io;.; = Q(yi^^i^) vtG[i,2]\5,. 

In particular, there almost surely exists a sequence such that 
sup y2 f (P) ^1^/2+ ml^^ - log a^fc - a/29 log log Xk log log log - O ( a/ log log ) , 

which is enough to imply the omega result for M{x) attributed to Halasz in the intro- 
duction. 

Very roughly, Halasz [5j investigates the process X]p<x /(p) , t G [1,2] by 

estimating moments of the counting function 



2 

where M is a parameter. However, the details are rather complicated, as it is actually 
necessary to split the sum over p into several ranges, and then reduce the range of 
integration to progressively smaller random subsets of [1,2]. This splitting is, in a 
sense, quite natural, as the parts of the sum taken over large primes are less correlated at 
nearby values of t (see §6.1). On the other hand, the splitting causes an accumulation of 
error terms in the analysis, one from each range of summation. The iterative approach 
is also highly reliant on being presented with the process as a random sum over p, 
whereas (at least if the f{p) were independent Gaussians) one might just as well be 
given a description of the process only in terms of its covariance structure. 



Appendix B. A multivariate central limit theorem 

In this appendix we discuss a multivariate central limit theorem of Reinert and 
RoUin [T6]. We view this as a "universality result", which sometimes lets us transfer 
conclusions about suprema of Gaussian processes to conclusions about the suprema of 
corresponding Rademacher processes. Reinert and Rollin's [T6] approach is based on 
Stein's method of exchangeable pairs. 

Suppose that T is a finite set, and that ai{t) E M. for 1 < i < n and t E T. Suppose 
also that (ej)[Li is a sequence of independent Rademacher random variables, and that 
{9i)i=i is a sequence of independent standard normal random variables. We wish to 
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approximate the (joint) distribution of {Xt}teT by that of {Yt}terj where 

n n 

Xt := ^ ai{t)ei, Yt := ^ ai{t)gi. 
1=1 1=1 

In the typical way, we construct random variables X'^ such that {{Xt)teTj {^t)teT) 
is an exchangeable pair of vectors. (That is, such that the law of this tuple is the 
same as the law of ((Xj')(g7-, {Xt)teT)-) Let J be a random variable having the discrete 
uniform distribution on {1, 2, ra}, independently of everything else; and let (e'J"^]^ 
be an independent copy of (ei)"=i. We define X^ as follows: conditional on the event 
{I = i}, set 

X'f- = Xt- ai{t)ei + ai{t)e[, t eT. 

The reader may check that the exchangeability property does then hold, together with 
the following regression property: 

n 

With a view to applying Theorem 2.1 of Reinert and Rollin [16], we calculate two 
further quantities: 

n n 

n{X't-Xt){X'-Xs)\{X^)ueT) = - V«.(t)a.(s)E(«-e,)'|(X„)„er) = - V «.(t)«.(s); 

n ^ — ^ n ^-^ 

i=l 1=1 



E\{X'-Xt){X'-X,){Xl-X^)\ = -J2\a^{t)a,{s)a,{u)\E\e'-e,f = - ^|a,(t) 



aAs aAu 



n ^ — ' n 

i=l i=l 



The reader should notice that, whilst we did not use the fact that the are Rademacher 
random variables up until this point, in the first calculation it allows us to conclude that 
the left hand side is deterministic. This means that one of the error terms in Reinert 
and RoUin's [16] theorem is identically zero; indeed, if h : M*^ — t- M is a three times 
differentiable function, and if the covariance matrix of {Xt)teT is non-singular, their 
theorem implies that 

d^h{x) 



\Eh{{Xt)teT)-m{Yt)teT)\ < \ sup 



dxsdxtdxu 



\ai{s)ai{t)ai{u)\. 



s,t,ueT i=i 

The condition that the covariance matrix should be non-singular is evidently unneces- 
sary here (at least if h is bounded, say), since we can ensure this by introducing #T 
dummy random variables whose coefficients ai{t) have absolute value at most 6, and 
then let 5 0. 



Specialising to our random multiplicative functions application, we would like to 
choose h to be the indicator function of a box in M*^, but this would not satisfy the three 
times differentiability condition. Reinert and Rollin devote a section of their paper [16] 
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to this "unsmoothing" problem, but the results they obtain are rather involved, and in 
this case we can easily overcome the difficulty directly. Let s : R ^ [0, 1] be a three 
times differentiable function satisfying 



1 if 2; < A/2(logloga; — log log |/) — 1, 



if ^ > ^/ 2 (log log X — log log y) . 

The interval on which s{z) must transition from 1 to has length ©(l/yToglogx), so we 
can find such a function whose derivatives satisfy = 0((loglogx)^/^), < r < 3, 

2; e R. Setting h{{xt)t&r) — ntgr'^(^*)' ^® conclude that 

PfmaxXj < \/2(logloga; — log logy) — 1) 

n 

< P(maxyt < ^2 (log log a; - log logy)) + 0((logloga;)^/^(#r)^ ^ max |ai(t)|2). 

i=l 

The reader may check that in the random multiplicative functions case, the error term 
on the right has order at most 

We have = {B + 1)M <C (log log x)^ log x, so this is o(l) as x — > 00 provided that y 
is at least log*^ x, say. The multivariate central limit theorem has supplied an extremely 
good bound, presumably because any individual Cp (or gp) has a very tiny impact on 
the random multiplicative function processes. 
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